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XaiG thesis piresents a p.cneral discussion of the problems involved 
in estimating t’le Circular Probable Crror, more coi.r.ior.ly referred to as 
the CSl’t A comparison is i.inde between the estirnates of the CEP under 
ttTO distinct models. The models are identical encept for the location 
of the m.ean vector in relation to the target. Tlie assumption of depen- 
dence is made in both iiodels and the resulting estimates are compared 
with the corresponding estimates obtained under the assumption of inde- 
pendence, Confidence interval • estimuites of the CEP are also presented, 

Tto m.ethods of rer.ioving outlier or "maverick” observations are intro- 
duced and some of the possible effects on t’;e estimated CEP are discussed, 
The diffci*ent estimating procedures are illustrated with t'-rce numerical 
problems. 



i i 






I ’ 



I 



i 




1 

I 

I 



i 



Tiic ter. . is fa. ill i 

assu.;.ptiou3 upor* uhidi tins r.^. 
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is the objective of this t’esis to cnplain 
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“!’.g C;i;r vas initially ccvclopecl in ordor to some criterion for 

r.ensrriug the cnpcctcc! ef fcctivencos of a particular vjeapon .syste;. and to 
give sor.o r.eans for comjiarins similar \:'capon syotcr.s or '..•canon.'j. In 
order to develop this criterion, it is essential that the assurptions 
used are '.'e3 1 understood and established, '-■h.s npproac’i ..;ost often used 
is to assjiuC that the errors in and across the line of si^ht are inde- 
pendent and that the variances are coyual uith the justification tint 
these assur.'.ptions produce a necliaiblc error. Itou’ever, an error may be 
introduced and it is necessary to at least understand ifaat is being 



assur.ed before makins judaet'.ent on the legality of any assu-.r.ption, fhis 
thesis therefore, attempts to explain such assumptions and to compare 
possible results of ^'.ahing certain ass"nptions in three c::a..,ple pr’oblcs. 
The problems are all ficticious and utilized onl^’ for t’ue purpose of 



explaining the estimating procedures and as.sur:ptions. 

The thesis is primarily directed at the reader ’?ith a college back- 
ground in calculus, some matrix theory, and some feel for basic pi'oba- 
bility and statistical procedures. The contents are arranged in six 
sections and three appendices. .lection I is an introduction to the prob- 
lem and the basic mathc.mati cal concepts \.'hich u-ill be used. Sections II 
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and III introduce the most commonly used estimating procedures. Section 
IV explains the problem of deleting outlying observations from the deter 
mination of the estimate. Section V introduces the confidence interval. 
Section VI is a sunrmary of the techniques used in the previous sections. 
Appendix A is concerned with the mathematical techniques vjhich are used 
to explain and transform the true orientation of the dependent variables 
Appendix 3 explains tX\ro methods of obtaining unbiased estimates of the 
Ci2P, Appendix C explains in detail the methods of integrations used. 

It is recomniended that Appendices A and C be studied before starting 
Section II, 

This thesis v?as written during the period January-June 1962 at the 
United States Naval Post Graduate School, Monterey, California, I v.’ish 
to express my gratitude to Professor J, 3, Borsting for his continued 
patience, encouragement, and most competent guidance while acting as 
faculty advisor, and to Professor Max v/oods for his continuous aid and 
technical understanding of the problem while acting as second reader, 

I also wish to thank my t^ife for the moral, clerical, and artistic 
assistance given me during the writing of this thesis as well as the 
past tvx) years. 
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INiMODlJCriOK TO r 

1.1 General Discussion of the Circular Probable Zrror 

riie problem of determining useful estimates of the parameters 
v;hich describe the distribution of the fall of shot about a target is 
directly related to the high cost of testing expensive v7eapon systems. 
Since relatively few tests are allowed because of this expense, it is 
not improbable that a good weapon system could be completely rejected 
because of inefficient utilization of the small airount of data available. 
Also, the size and yield of the warhead is directly related to the esti- 
mated parameters. If the estimated variance is large, the effective 
radius will also have to be large to cover the target complex, and in 
turn the missile will not be able to reach the range of the same missile 
with a smaller vjarhead. The most efficient use of the limited data will 
thus greatly reduce the risk involved in reducing the warhead size and 
increase the potential range. It also may aid in v;capon deployment or 
assignment to larger targets because of the greater confidence that can 
be placed in the estimates. It seems logical that if a great deal of 
confidence can be placed in the v;eapon, fewer weapons will have to be 
assigned to a target, thus releasing some weapons for other targets. 

The important point is that the confidence placed on the estimators 
must he high enough to reduce the risk involved and provide a sound 
basis for decision. 

One method, which is comi:'ionly used, to measure and compare the 
estimated parameters, is called the circular probable error or CEP 

method. The CEP is defined as the radius of the circle v:ith center 



1 



at (u^.u ) which includes 507o of a bivariate probability mass. The 
^ / 

illustration in figure (1) shows the form of this function. It is to 
be noted that most of the volume under the curve is centered at the tar- 
get and decreases as the distance increases from the target. This par- 
ticular function is well founded historically on the basis of the 
analysis of observations from long range gun fire. 




UNJe/K, -j- (a//) 



Bivariate Probability Mass 
Figure 1 

The bivariate normal distribution is a generalization of the normal 
distribution of a single variate and is bell shaped as shox^n in figure (1) 
above. Any plane parallel to the ;<.// plana that cuts the surface will 
intersect the bell in the elliptical cvtrve showi in figure (2), 
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Bivariate Density Function wliich has been 
Cut by a Plane Parallel to the x,y Plane. 



Figure 2 



Any plane perpendicular to the x,y plane v;ill cut the surface in 
a curve of the normal form as shown in figure (3). 



/ \ 






(^ X,Uy> 



Bivariate Density Function waich has been Cut 
by a Plane Perpendicular to the x,y Plane, 

Figure 3 

The bivariate density function actually represents a five parameter 

family of distributions, the parameters being the means (u ,u ) , the 

X y 

variances rr^» CT\ and the correlation coefficient F, This function is 

X y ' 

symvaetric about the means and has its greatest value at the point (u^,u^), 
It sliould also be noted that if the errors in the x and y directions are 
Independent and the variances equal, then the distribution will be in 
the shape of a boll with tv;o of the opposites sides "pushed in" an 



3 



equal amount 



The effect of the variance is siiowi in figure (4) 




Two Bivariate Density Functions with 

Different Variances about (u^,,u ); Side View 

y 

Figure 4 

If the variances are equal, a plane cutting the surface, as in 
figure (2), will intersect the bell in a circle. 

The height of the curve, forming the density function, at any 
point "a” is related to the probability of that point. Since this func- 
tion is continuous, the probability must be expressed in the form of an 
interval since the probability of any single point is zero, however, 
the probability that a u... v^.iai^ie X, in the distribution being 
considered, falls in an interval is equal to the area under the curve 
in the interval being considered. That is, the probability that 
a4X4b is equal to the area shown under the curve in figure (5). 

Note that since the area under the curve about the point (u^,u ) is the 
greatest, the probability that the random variable X fall in this interval 
is greater than that of an interval of equal length away from the point 
Cu^,u^^), This is shown in figure (6)« 
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Univariate Density Function 
Showing the Area Under Con- 
sideration When Determining 
P(a <X<b). 



Figure 5 




Univariate Density Function 
Showing the Areas Under Con- 
sideration In the Intervals 
(a,b) and (c,d) whore 
b-a = d-c. 

Figure 6 



1,2 Kathemati cal Ko tation 

X and Y are said to have a bivariate normal distributioxi if their 
joint density function, f^ given by 

(1.1) f„ „(x,y) = 1 expr- 1 __ [ / X - (-ix t delx~^(x\ /y- /y- ar ] 

1 A(/-e^nl w / \v; /y Vy/* , v^/. 



1 . 2.1 

The quantity x is said to be an observed value of a numerical 
valued random phenomenon X if for every real number x there exists a 
probability that X is less than or equal to x. In this pi'oblem the 
observed values of the random variables X and Y are the coordinates 
of the data points v/i th respect to the target. These coordinates can 
also be referred to as miss distances in and across the line of sight. 



K2.2 



The parameters and arc the mean values in the x and y 
directions respectively. The mean of a probability lav; is equivalent 
to the expected value of the random variable v;ith respect to the proba- 
bility law. This is v;ritten as: 

C,, ,,(x,y)dxdy 

.V, L 




The mean value cannot be determined exactly in our problem even if all 
of the missiles have been fired but estimates of the mean values can 
be determined from the observations. 



1 , 2.3 



The expressions (x - u ) and (y « u are the deviations from the 

X y 

mean values in the x and y directions respectively. 



1 . 2.4 

and ^ are the standard deviations in the x and y directions 
respectively. The standard deviation is defined as the square root of 
the variance of the probability law. The variance V" is defined as the 
second central moment of the probability lav; and is defined by: 



( 1 . 4 ) 





u 



2 



X 



It should be noted that the mean values deterr.iine the location (u .u ) 

X y 

of the center of the normal density function and the standard devia- 
tions and ^ ) determine the shape of the function about the mean 
in the x and y directions respectively. 
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1 . 2.5 



The correlation coefficient of tvro jointly distributed random 
variables X and Y is defined byp= COV (X,Y) uhcrc 



The correlation coefficient provides a measure of hoi7 good a predic- 
tion can be formed on one of the random variables on the basis of the 
observed value of the other random variable. In other vrords, if the 
value of one of the random variables is given, the expected value of 
the other random variable can be determined. This may be vzrittcn as 
E(X)Y) where the value of Y is given. That is, 



is the conditional density function of the random variable X given 
the value of the random variable Y, The conditional density function 
is derived from the conditional probability of a random event A, 
given a 'random variable X, This notion forms the basis of the mathe- 
matical treatment of jointly distributed random variables that arc 
not independent.^ 

In the particular case vjhcrc two random variables X and Y are 
jointly normally distributed, the conditional expected value of the 
raiidom variable X given that the random variable Y is some particular 
value y, is a linear function. This linear function is related to 
the orientation of the shape of the density function as sliovm in 
Appendix A, 

^ "!Iodcrn Probability Theory and Its Applications" by Emanuel Parzen 
/!/ of Stanford Jniversity. 



( 1 . 5 ) 



COV (X,Y) = E(X Y) - E(X)E(Y) 







( 1 . 0 ) 
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In order Lo sinplily the notation^ it will be convenient to 
represent the bivariate density function in iriatrin notation. The 
terms in formula 1.1 are first arranged in the form 



-2/1 1 -e^ ^ ^ 












U O^p-i 2'/?z 



)/A 


__r 


V 

1 




I -r 


1 




7-yV' 






;7here 






■ 1_ 




PR V7 



Using this notation, uc are nov7 ready to look at several r.iodels 
investigating the CEP and confidence interval of the CEP. 



1,3 '-‘he Dasic Problem in Estimating the CEP. 

The problem of estimating the CEP is essentially that of finding 
the radius of a circle with center at (’-'j.s'-'y) such that the probability 
is ,5 that a random point (E,Y) uill lie inside this circle, -'his may 
be expressed as 



) i’ 2+(v-u^) ° y)c;xc;y 

< I" 



u’nerc fy •■(Mjy) 
is given ’6y 
f o rmu la (1.1), 
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f 






♦ 










■TW 



I 



In ordcx' to inti'ocluce the problem, the assumptions will be made 

that the mean values arc zero (u =u =0) , that the errors in the x and 

X y 

directions are independent ( f' =0), and that the standard deviations 
are equal (^'x = Vy = V*), The probability statement is thus simpli- 
fied to 



y 



(1.0) 'L^< - 1 If exp - f (x^ + v^) l dxdy = .5 

2 7r'V'^ r . L -■ 



5s V* 



frt Y^<r 

In order to perform the integration let f 



.an-i = 



1 = s.sinS, X " Xcos3. 
Then 

^<r 



r(?sj:r) “// f £(rcos8) (rsin9|j J I drdS whore J -idx/dr dx/d3/= -r 

' /dy/^r dy/c'3( 



(1.10) T (X^r 



^ — 






,.r ^ 

r e:q> dr « 1 « ex 




2 



o o 



'Therefore j the C'P r - K1774V", 

.he problein o: estimating the CEP is thus one of obtaining a 
function of the n sample point.s ^ v;hich irlll 

estimate the standard deviation T. The estimators are functions of 
the observed values which are used to estimate the true values of the 
parameters. For example^ if m points from a sample arc given, the 
average or mean value is estimated by 



(Uli) ^ 

X 






• oo**9e»09 



-f X 



:n 



m 



fhe distribution of x bccores closely concentrated about tlic true 
value u,. as :n becomes large® 
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Tliero are many v?ays to estimate the parameters under investigation 

and it is therefore necessary to specify certain properties wliich are 

desired in estimators, I’or example, tlie distribution of the estimator 

should be concentrated near the true para.ncter value. If and 

A 

different estimators of 3 ».7ith density functions and f ^ (3 ) as 

shorm in figure (9), then '$'9 is a better estimator of 5 than^^j^. 




The Density Panctions of Tv;o Estimators 
Figure 7 

Other properties v;hich are desired in estimators are defined as follox.’c 

1.3.1 Relative Efficiency. The relative efficiency of tvra estimators 
is defined as a ratio of the mean square errors of the estima- 
tors. Tlxat is^ 

(1,12) E( 8 , - 6)^ = ?v,F. v/ncre E.F. is the ratio function. 

If n.F. < 1 , then is said to be a more efficient estimate 
of 3 than ^ 2 » 

1.3.2 dnbiased Bstimator. An estimator. ^ is said to be an unbiased 
estimate of the parameter 3 if E(S) = 0, 

1.3.3 Consistent Estimator, An estimator^ is said to be a consistent 

estimate of 3 if P(^ ->0)“>1, as n->*^ . 
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1.J.4 Zcficicnt Jlstiniator. The estimators which have the smallest 
lirrdtins variances are called cEficient estimators oT S. 

The estimators which v;ill bo used in the first part of this thesis are 
s'.iowii in Table a. 




A more detailed discussion of certain estimators under special 
assumptions is presented in Appendix B. 

1.4 The Troblem of JcpcnJcncc 

In the sijnncry •'problem, J;hc errors introduced in the line of sight 
are due to variations in the range and projectile initial velocity. The 
error across the line of sight is due to bearing errors. Since bearing 
errors and range errors arc independent of each other due to the fact 
that they are obtained from different sources, the mathematical assump- 
tion is generally made that these errors are also statistically indepen- 
dent. llotjcver, if v;c broaden the perspective to look at the major errors 
introduced In a missile trajectory, the major errors in the line of 
sight and across the line of sight are probably not independent of 
oacli other. 

This is primarily due to factors which did not especially influence 
the gunnery fire control problem such as errors in ship's navigational 



11 



posiwion^ ,..rrors introdacec' by nnsnilc att:itudG during the time o:. 



pov 7 ered especially at cratoff^ and weather conc:itions over the 

target , 

In the .gunn.ex'y problci.i there are two types of navigational problems 
fhe first is the relative problem o! firing from a i.x)ving object to 
another moving target where the fire control problem is one of obtain* 
ing relative bearings, ranges, courses, and speedSo But the firing 
ship’s true navigational position relative to the target is not an inilu 
encing factor'e 

The second problem is one of shore bombardvaent wh.ere the ship’s 
navigational position is determined by visual fix* This is closely 
related to missile launching except that the first shot in shore bombarc 
ment does not have to hit the target because the shore observer can tell 
th.e ship V7hat spots to apply to the generating fire control solution, 
fhcrefore, this again becomes a relative fire control problem \v\ierc 
cri-ors introduced by the ship’s and target’s relative positions are 
corrected by spotting. This is not practical in long range missile 
launching because of the inability to obtain corrected visual naviga- 
tional positions relative to the target due to lack of observers at th . 
target area. IJhat is done instead is that the probable errors must ol 
predetermined and enough missiles launched to give a high probability 
of destruction of the target complex. If we assume th:at th.e launching 
s!iip is determined to be at the launch reference point then the errors 
introduced arc as shovrn in figure (3'. 
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A-'/ 







3yrg = firing bearing from 
ship to position target 
\7ill have at detonation. 



True Target Bearing Diagra.ii 
Figure 8 

3yrg is proportiotxal to f Lxong ? 

I Lyong J 

Byrg ' is proportional to C Lxong + Lxong t 

( Lyong + Lyong i 

Cince Byrg differs fx'on Byrg' by the errors introduced in and 
across the line of sight, the errors are reflected in the q('Jgm)s inter- 
polation co-mputer as errors in velocity to be gained, u'.iich have not been 
entered. But the errors introduced arc not independent because the in- 
puts influence changes in velocity to be gained in both range and cross 
range directions as shoim beloxj; 
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l^lo\7 Diagra.n ol the Change la Velocity to b3 Gained 

Figure 9 

In the gunnery problem^ the \;eather conditions over the firing 
sUip^s position are the same as the weather conditions over the target^ 
therefore these values can be accurately estimated, hue missile firing 
ship depends upon intelligence and v;eather forecasts to predict the 
inputs for target weatlier conditions. This infoia.iation is therefore not 
as accurate as in the gunnery problem. wCince the errors introduced by 
v/eather predictions influence the missile trajectory over the target, 
the re-entry body is rrjost likely to be tnoved in any direction and the 
probability that the errors in and across the line of sight are inde- 
pendent of each other is low. 

The errors introduced by missile attitude during cutoff can best 
be illustrated by a vector diagram. 
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Vcctior Diagram qz Velocities at Cutoff 
Figure 10 

fhe uussilc attitude at cutoff can be regarded as a ran^oin variable 
because it can assutie any attitude due to tbe fact that the rec[uirc...unts 
to initiate cutoff are due to past and present missile velocity and not 
to a predicted velocity at some ^ t after cutoff* Thus the errors intro- 
'.luced during the of cutoff V7ill influence the errors in and across 
t le line of sight in a random manner, rhcreCorc, the probability that 
iV.c, errors in and acx'oss the line of sight arc independent is again 
io\;v=rcde 

d‘. c concl.ision is tnat due to the complexity of the fire control 
proble,.:^ t' c errors in aid across the line of sight are probably not 
indeper dent , If uc appi'onch the prublc:i \:ith this assumption and find 
that '‘he increase in accuracy gained by this model is not sufficient to 
the increase in the mathematical difficult}’, do in 'eponJent 
.tod cl can used, 

1 • j Curpuiry 

Section cue har boc;'. .:.u..-d to sec up the environviit o. proble.i 

thit IS to bo analyzed, ffie bcsj.c assu.t.ptioa .nadvi is that the fall of 



shot about the target is a randora variable which obeys the bivariate 
normal probability laws. The assumption has been made that the errors 
in and across the lino of sight are not independent and one of the 
objectives of this paper is to determine the effect of this assumption 
on the C2P, 
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jr-CTICN II 



r^i c^r jith t:^s d:*N3ITY fu..ctioi! 

Ar T*:E TARGET: I'lODTL I 



2 A Introduction 

The n'iost ir^.portant assumption made in this model is that and u 

X y 

are zero* This means tliat the center of the bivariate density function 
is at the targets Although this is the desired condition, it may not be 
true initially due to the complexities of the fire control problem. One 
of the determinations that is made from the analysis of the firing data 
is whether a correction should be made to the fire control solution to 
bring the distribution of the fall of shot over the target. Therefore, 
b> starting with the assumption tliat the center of the distribution is 
at the target and finding that this assumption is v;rong, it becomes 
necessary to determine and apply the correction to the fire control solu- 
tion. Also, it should be noted that although this assumption may not be 
true initially, it still may be true after correcting the initial fire 
control solution. 

If the center of the distribution is close to the target, (0,0) in 
the coordinate system, or suspected of being so by analysis of the test 
data, the estimators determined from this model may be better estimators 
than the estimators used in !‘odel II in Section III- \ comparison can be 
made between model I and model II, using the criterion of relative effi® 
c^c"icy to determine V7hich model is theoretically the best. This criter- 
ion is c::piained in Section III. 

In this model the errors in the x and y direction are assjir.eA to 
he nv)n«indei>end^'"t ^nd distributed in accordance with the bivariate 
irobabilUy riie. prw>bai)i i i ty that a random point (/.,Y) will lie 
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’7ithin a circle of rarlius is equal to 

► i.QX 



(2,1) r(k,T,Vv-)=’P(l^?+Y^kVl„.,)= n r., v.(><»y)'ixdy= ff 1 exp -i^'A<;,dX(.y 

xf max yj .,1c 



jl X^+Y^ <f kT max 



( 'C^+Y^ < kV" max 



where Z and A are defined in (1.7), 

In order to integrate over this form, it is necessary to first make 
a transformation to an orthogonal density function. The reason for this 
is that due to the assumption of non- independence this density 

function is oriented along non-orthogonal lines called the expected value 
of X given Y and tlie expected value of Y given X or in simpler notation 
li(XIY) and S(x7X) as defined in Section 1.2.5. This orientation is 
illustrated in figures 11a and lib. 





Three Dimensional Diagram of 
the Crientation of the Sivari- 
ate Density function where 
0 < P < 1, a-)b^c « ‘;o°. 



Figure 11a 



Two Dimensional Diagram of t' e 
Bivariate Density Function 
Formed by a Plane parallel to 
the x,y Plane Cutting the 
Density Function. 

Figure 11b 
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Tuii: re r i i c ion is shoion to be valid byproving ti nt 



2 . 2 ) 



iv’bcrc, .! 






and 






(2.3) T." 




_2 i 

V> -f \Jy ~ 






a 

rhe transfoiTied density function thus becomes 



"^V 



(u , v) 




exp 



-V.J’ A*.. 



=3 



V. *: • ^ > 



_JL 

2 T WiTi/ 




Jhe rcoi‘'ienl:Gd axes are now as shom in figures 12a and 12b, 







F7//^0 




1 



( i Ou</S 



Three Dii.easioaal Diagram of 
tlie .xeorjcnted /\xes of tne 
DivariaCt. ..ens^ily f\-nctio;u 



Tigure 12a 



Two Di mensional Diagram of the 
Teoriontod Di variate Density 
function Tormed by a Plane 
Parallel to t'ae u.,v Plane Omitting 
the Jens icy Function. 

Figure 12b 
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-he uc.aiJLS ul 



crensfor-.mation are contained in Appendix /w5. 
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This transCorrnod density function can be handled i^ore easily 
because the terms involvini 3 the correlation coefficient have been re- 
moved. The probability that a point (U,V) in the new coordinate syste:.. 
will He within a circle v;ith center at the origin and radius k^Hs 



(2.5) 






exp -V 



v)dudv = P(k,c) « 1 

2T^r^)j _ 




%) 



z. 



dudv 



where c “ This form is simplified in Appendix C.5 to 

va 

T 

(2.6) F(k,c) 



^c 

t” 



1 - exp f-k^ rCc^+ l) + (c^ - l)cos^j? 
(c''- + l) + (c^ - l)cos0 ' 



d0 



The valuer, of (k) 

tables one and tw 

c = ^ in order to 
VIi 



o 

for various values of P(!;,c) and ic) are tabulated in 
. Table one is used by entering the table with 
find k. This table can only be used for P(k,c)=.5. 



Table tvra is used by entcrin 
P(k,c) in order to find k. 
of P(k,c). 



g the table with c «= 5: the probability 

This table can be used for various values 



2,2 Estimating the CEP using ’.ddel I 

The first step is to find estimators forV*x,Vy and from the n 

observed points done by computing 

the sam.plc variances the sample covariance Vxy, and the sauiple 



correlation ^coeff icient \ are defined as follows; 

Si* - = iiy -- 2 v.' 






? ^ 

^ a ^ a ^ 

In these formulas, V ' Ys ^xy are unbiased estimates of Vx,'d~y, 

and 'T <y , 



20 



The transformed estimates of the variances are computed nent. 






^ 2 , 7 ) ^(J^^ 






T/ fTy"- ^ V ^ 

3. 

/An /A ^ T v*^ 

V-y" f . 



)le tv7o is entered with P(k,c) = .5 and c *= to find k. 

^ .A 



The estimate of the CEP = CEP 



1 



k 



2.3 Estimating the CEP using the Assumption that the Errors in the 
X and y Directions are Independent. 

If it has bean assumed that the errors in the x and y directions 
are independent^ an estimate of the CEP can be obtained by using the 
estimators in model I except that the estimated variances V"x and V” y 
are used instead of the estimated transformed variances T u and V* v 



c- 



^rnin 



^ /N 

where Thnin ~ Iiin(vx,\Ty) 



P(k*,C'0 = .5 , 



Table t\JO is entered vjith P(k*,c*) and c* in order to find k*. 

Then this estimate of the CS? = CEP|’ « k* V* max. 

2.4 Information About the Problems. 

In the problems which follow, both estimates of the CEP V7ill be 
obtained in order to compare the results in the summary in Section VI. 



2 . 5 Examp 1 e P ro b 1 ems 

fhe problems, which will be used to compare methods of estimating 
the CEP, have been set up in three cases. The first case will have ten 
sample points . o . . . . . o • • (x^^^^y and is representative of the 

point in time w‘here sone initial decision may be made as to whether the 
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:•/ sao'.ild be accepted^ rejected, or that ;ore tests should be 

joiv-uctecf. r>.e second case v/ill have fifteen sample points (:Cpy^)o*.o« 

• .y ) i.'hich v;ill include the first ten sample ooints. This is 

13 15 

intonhu. to represent an intermediate point in time \;here some terminal 
ioexsion may be ma'c on the acceptance of the weapon system. the timlrc 



case v7ili consist of Lv7enty five sample points (x,,yj^) 






It shoulc^ be noted that as the number of observations increase, the 
c-stiniaLors are more likely to be closer to the true values. The actual 
distributions of the 25 points are shovni in diagrams 1,2, and 3. rite 
coordinates of the points are as follows: 
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The value of the CEP obtained using the estimators from this 
section will be compared to the estimates of the CEP from Sections III, 
IV, and estimators which are explained in Appendix B. This comparison 
^;ill extend to the problem of rejecting outliers and t!ie comparison 
will be presented in Section VI. 

Although these problems are primarily oriented at tests involving 
the more expensive 'weapon systems, such as the IR3:;, the environment 
can be extended to less expensive weapon systems whic’n will naturally 
have more sample points. Although it was intended to make the problems 
as realistic as possible, no attempt was made to utilize data from 
actural missile firings. 
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Data points and con'.putat ional results. 
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Problem I. Case II* Data points and computational results 
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i'i.cbiem Case III. Data points and 
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Problcn II, Case I. Data points and computational results. 
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Problem II^ Case III. Data points and computational results. 










X . 



11.7/ 



^xy = .-^2.r 



% -- 

^ = 9//?. 

/V 

r ~ > on 



r,,^ = /i.7r 7ir~ 






-- 



1 

1 Dependent 
Ho del 


Independent 
rjDdel — 


,?7Z 
/<= yjo 

= ^.77 


i 

UOS' 

1 



Hf:£_rarr 9 



29 



s»* 



ronlein Ills Case 1. Lata points and computational results. 
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Problem III, Case II. Data points and computational results. 
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SECl'ION III 



DETr:?.l.i:miG TiIE cep IH^EK T1:e density FE^CTIO.. is Z2U:E?.EiJ 
AT THE POINT (u ,u ) : MODEL II 

>' y 



jA Introduction 

The most important assumption made in model II is that if an 
infinite number of tests were conducted, the mean values of :: and y 
T'?ould be and u^^ respectively. This means that the center of the 
bivariate norr.:al density function is at some point (u, ,u ) with 'respect 

X y 

to the target at (0,0)* 

If enough tests have been conducted to ascertain that this density 
function is offset from the target through the utilization of the esti« 
mators, then it may be possible to enter a spot (-u^,,-u^^) to correct 
the fall of shot. 

In this model the errors in the x and y directions are assumed to 
be non-independent but are distributed in accoi'dance with the bivariate 
normal probability laws. 

The probability tViat a point (x,y) wb.ose coordinates are chosen 

at random v;ill lie ^.7ithin a circle of radius k'^nax with center at 

(u ,u ) is equal to 
X y 



( 2 , 1 ) 



/(X -u ^ IcTmax 






/T:-u )2+(y-u k Vr-.ax 



ox?(=Ji Z'A2^dy 



2TT/A""/^ 

' / 

J ,x-=u, j ^ (y 2 4 k T r.:n> 
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In c to interrcitc over this form, it is necessary to first 

translate t!:e axes before making the transformation because the density 
function is oriented along noia»orthogonal lines avjay from the center o 
x,y coordinate system* This orientation is sho'f.ni in figures 13a and 




Figure 13a 
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Function with Center at (u mu ) 
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Cutting the Density 



Figure 12b 
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fron their 



L'l'.e translation is .nnde by subtracting the leans 
respective random variable X and That is sir-ply (X - u„) and 



7'aere in this case the matrix Z non becomes Z =/X. 



X 



rhe transfor'mation is then of the same form os the one in Section Ila 



3,2 Zstimating the CZ? Using Itodel II 



The first step is to find esti.,.ators for , TT , and P from 

the n observed points (x^yp (x^.y^) ........ (x^,y^) . This is done by 

first computing the sample means x,y an’ then computing the sample 



- > 7 _ 

variances Vx ^ ^ the sample covariance and the sample correlation 









coefficient 0 as follotrs: 



n 



^ :.’i 

^ rt 






Z'.-H - ■'>' 



a-1 



rj - - 



_ 



n-1 



^ £ (X, - X) ;y - y) 

^ n-1 



y\ 



^xy 



i!ie transformed estimates of the variances arc then computed using 
formulas (2,6). .able 1 or 2 is entered with T(k,c) = ,5 and 
c - V V to find !:. The estimate of the CZI == CZr„ = k V* . 



>3 .Istirrating the C2P Using the Assumption ..’hat the Zrrors in the 
X and y directions Ai*c Independent. 

If it has been assumed that the errors in the x and y directions 



are independent.^ an esti-matc of the CZl can be obtained by using the 

estimators in rrodel II except tnat the estimated variances y andy 

X y 
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I 



i 










^K;j«M;aSt3i- j({(| 



S^Ja3' 



T 









are used instead of the estimated transformed variances and . 






Then C” = 



V' min 
^7“' ma:c 



vjherc 



= ,«// (V»,Vy) 

'W A ( V X / ^ V / 



Tabic 1 or 2 is entered with P(k*,c*) and c* in order to find k*, 

/\ 

Then the estiniate of the CEP is CEP*« k* 7* « 

z niax 

3#4 Comparison of rbdels I and II 

If IkDdel I is the true situation, then the estimator defined in 

Section II is the mst efficient estimator. If the mean is not at (0,0), 

(Ibdel II) then it still may be advantageous to use the estimate given 

for Ibdel I if (u ,u ) is not too far av;ay from the origin and if the 
X y 

sample size is small. This is because tv:o degrees of freedom are lost 
in estimating (u, ,u ). This problem is treated in Appendix B using the 

.-V y 

criterion of relative efficiency. 

3.5 Problem Set 
Problem I, Case I 
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Data Points in Proble... I, i;=10 
Diagram 13 
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Frobien 1^ Case II. 




Diagran^ 14 
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problem I, Case III. 
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Data Poii;ts in. Pro bleu II ^ K=25 
Diagram 15 
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II 



Problen II, Case I 




Diagram 16 



Problem II, Case II 
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Data Points in Problem II, U~15 
Diagram 17 
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Frcble-t; 1 1., Case III 




Problem III, Case I 
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Problcn III , Cnse II 




Diagram 20 

Problem III , Case III 
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oj:cticii IV 
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OVING TI;E 0 



lER; 



This model covers the problera of outliers and attempts to show 
some of the reasons for eliminating the outliers from consideration in 
the determination of the estimates as v;ell as several methods for 
eliminating them* • 



A*1 Introduction to the Iroblem* 

The general problem of removing outliers is related to the fact 
that it is desirable to obtain estimates of the parameters for the 
underlying bivariate density function v:hich are not biased by obser- 
vations from a distribution different from this underlying distribu- 
tion* This in turn \7ill yield nnre accurate estiniates of the CEl * 

It is necessary to safeguard the estimate of the CEP from the ill 
effect of including information in the analysis that is not due to 
variations in the population of missiles, but is caused by some other 
factors such as weather or human errors* It is also possible that 
observations which have large deviations from the other observations 
may come from different distributions due to improvement in the 
missile design* This is especially true during the missile develop- 
ment stages vjhere each succeeding missile has improved or different 
subsystem components than preceeding missiles* For example, an im- 
proved fuel may not be correctly compensated for in the ^nssile guid- 
ance and fire control computers or a new type switch may not function 
quite as initially designcdc The combination of changes may influence 
the range of th? r'issile so that it lands farther from the target than 
predicteoo If t ton is correccly .;;ade for th.c succeeding shot, 
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i 



I 



it: soo:.':s reasonable that tlie observation for the first shot should not 
be included in the determinat ion of estimates for the Cdf. 

.\lso, as improved subsystems are added to the missile, it is 
possible that the earlier missiles V7ill not have the same density 
function as the later missiles and thus have a different ChP. In this 
case, it may become necessary to include only the later developed 
missiles in the determination of the CE?. Due to the fact that the 
;nissile development V7ill be a continuing process with each missile 
slightly different than the proceeding one, it may not be easy to dis- 
tinguish bet\7cen these distributions<, This is because both distribii- 
tions \7ill have some observations close to the target and others a\?ciy 
from the target, 'fhe figures below may help to illustrate this point. 



0 



■><; 
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1 \ 
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X (IXIS 
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Observations frou: Two 
/istribu tions about the I'arget 

:c first population 
0 second population 



density Functions of 'i.h;o 
Distributions about the Target 



Figure 14a 



Figure 14b 



41 






I 



\ 

I 

I 



t siiould be noted in figure 14b that distribution I has so*;u: 
probability or. occuring in distribution II. If this probability is 
lar^Cj, it may bo extremely difficult to separate the t\:o distributions. 
In fact^ if it is desired to separate the tv;o distributions, there is 
some probability that observations belonging to the underlying distri* 
but ion under consideration will be removed along v;ith the observations 
from the distribution that is not being considered. Thus one of the 
problems in removing outliers is to keep the probability, that the 
observations removed as outliers v;hich do in fact belong to the under- 
lying distribution, as lov7 as possible. If this probability is small, 
it is possi!)le that the observations belonging to the underlying dis- 
tribution which are still removed vill have such a low probability of 
occurence that their removal ^jill still lead to a better estimate of the 
parameterso This may be especially true for s.nall sample sizes where 
one sufficiently large or small observation can totally ruin an analysis 
of the data. Therefore, in order to eliminate an arbitrary result, it 
is necessary to establish some criteria for eliminating these outlying 
observations. 



4.2 Criteria for Ilcjoction of Outliers 

naturally shots which land at long distances from the target can 
be easily identified as wild shots or outliers v;ith possible unknom-n 
errors. But as the observations move closer to the target, it becomes 
necessary to utilize some type of probabalistic consideration for the 
rejection of outlying observations. One x:ay to approach a solution to 
this problem is to set it up as a hypothesis testing problem. On the 
basis of the observations (x. .y . ) . „ . o o • o « (r. s,y ) » a test is ..:ade of 

Ji ' 1 n n 
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the iiypothecis that the observed point belongs to the underlying 

distribution. dhe test is then conducted for each (x.,yj) for i = 1,2, 

n, one at a time. The alternate hypothesis is then that the 

observed point (x.,yp does not belong to the underlying distribution 
but to some different distribution. This can be x:ritten as: 



IL: f.^ (x. ,y.) = (x, ,y,) for each i = 1, 

i 1 “ 0,^0 ^ 






II,: f., y(x ,y ) / f (x ,y.) uhere: f„ „ (x.,y.) is t 

i 1 1 -\3,-0 ^ ^ ^Sd,^0 ^ 1 



he 



true underlying distribution* 

The probability of a Type I error will be called v x;here vis the 
probability of rejecting the hypothesis that the point does 

belong to tlie underlying distribution v;hen in fact it does belong to the 
underlying distribution. This can be expressed as 

Prob ^ error] - v 

The probability of accepting the hypothesis that some point does 

belong to the underlying distribution when the point does not belong to 
the underlying distribution and is called the Probability of a Type II 
error. 

Thus the probability of the IVP^ I error nay be called the risk 
that the experimenter is v:illing to take in ranking a mistake by rejecting 
a point ss an outlier vjhich does in fact belong to the under- 

lying distribution even though the observed value does exceed some value 
specified by the criteria. Natural ly, it Is desirable to try to keep v 
small but if v is too small then the Type II er*ror will increase and all 
outliers will be included in the determdriacion of the parameters. 
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A. 3 i.'cthod I For the Rejection of Outliers 




that a ranc’on point (X,Y) will lie uithin the ellipse Z*AZ = k. Z'AZ is 



Ceor.etrically it is the probability that the point (X,Y) will lie 



/ i 

/ 

Offset ellipse ::ade by a Plane Parallel to x,y 
Cuttinj tlie Density Function 

Figure 15 

Due to the orientation of this density function, it is necessary to 
make the transformation to the orthogonal u,v coordinate system in order 
to integrate over this for.n^ This transforiv.ation is made in the same 



the matrix notation for the quadratic form of the dependent bivariate 



normally distributed random variables h and Y. That is 




\ 2i 



and 



k is defined by 



(A. 2) 




inside the ellipse made by a plane parallel to the x,y axes cutting the 



density function as shot.n in fieurc 15 




/ 



f 



inaimcx as in Sections II and III. The probability can now be expressed 



Cm. 3) PC.”A*'.i < k^}« ^(u,v)dudv = ^ ^ J exp (-%v;» A*’.,') dudv 

VJ‘A*v}<h “ '' 

\;here V.’ Al = 



Vh^ Vv^ 



lettins "^2 ~ ^ * (4.3) reduces to 






(4 



.4) P(T2<kJ ) =■ ^expC^l^t) dt 

T' / 



T>. ^ 



The random variable T has the Chi Squared distribution \;i th two degrees 
of freedom. The above formula is a special case of the follo^Jing result 
If 2^ are independent and normally distributed random variables with 
means u. and variances , then 

(4.5) T - 



' ■J(VJ 



The degrees of freedom mis the number of independent terms in the sum. 
The density function of T, is 

(f"/- -1'- 



(4.6) f, (t). 

‘3 (£!_,)/ ;)(.® 



t > 0 



t < 0 



Tlie areas under this density function are partially tabulated in Table 4 
The desired percenta3e of the area under this curve is found by entering 
Table 4 V7ith 1 - v and the degrees of freedoir* ni. 

The decision rule t-iat is used for tlie elimination of outliers is 
to state that an observation is an outlier v^licn 

3 

’^lucrod .c:ion to the Thcoi^ of Statistics*’ by hood /2/ of 

P. and Co i no r l i o n . 



Z. (*■} 



(4.7) 






< 







Vi 





4.4 Kethod II For the r.ejection Of Outliers, 

This nethod for the rejection of outliers is based on the probability 
that a random point (X,Y) V7ill lie within a circle of radius k ^ , 

Then, letting 



r = |J(x - u^) '• + (y - Uy) J, k is defined by 
(4.S) p j^[(x » u^)2 ^ (y _ 

Y" ^ax 



Geometrically, it is the probability that the random point (X,Y) will 
lie inside the circle imposed on the quadratic form made by a plane 
parallel to the x,y axes which cuts the density function as sho'^^jn in 
Figure 16, 
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Illustration ^^'iov/ing tho Circle o.T Interest 
irhich is I:r*posed on tlie Sllipse !:adc by a ilane 
Parallel to t’ae x^y Plane Cutting the Density Punction 

Figure 16 



Due to the orientation of this density function, it is also neces- 
sary to :r.ake the transforiuation to the orthogonal u,v coordinate syster. 
The geometrical areas under consideration arc slioun below in figure 17 
for this transformed density function^ 




a 






Illustration Showing the Circle of Interest 
which is Imposed on the ellipse nadc by a Plane 
Parallel to the u,v Plane Cutting the eiisity Function 



Figure 17 



It: G’:o;.ild be noted that this r.ietliod \ 7 ill reject points outside the 
circle but ir.sidc the ellipse \ 7 hich is estimated rror.i the data points. 
Therefore, unless the variances are equal, this method vill generally 
reject points farther from the target t'\an rr.cthou I, since some points 
on or near the major axis will be outside the circle as shov7n in 
figure 17. The circle is necessarily of sr.allcr diameter than tlic 
major axis of the ellipse unless the variances are equal and then the 
circle and ellipse v^ill be synonomous. This can be seen from the follow 
ing inequality: 



W-9) 



-.2 






where = r.iaxOTTd, \ 7 ^’ 2 ) 



The probability that th.c point (U,V) in the transformed coorc inate 
system will lie within the circle/l’^ is expressed as 

(4.10) P|^J*l]2+y2^ J == yCti^v) dudv = P(!:,c) \;here c= 

kVu ^ 



This fornujila is the sane general formula that was used for the determin- 
ation of the C!:? except that .5 has nov; been replaced by (l-\7) in the 
range from .5 1. The decision rule that is used for the elimination 

of outliers is to state that an observation is an outlier when 



(4.11) W!A*h. - Z!AZ.>k^ v?here is obtained fi'ora table 2 by enterin 
^ 1 1 1 2 ^ 



\:ith 1- and the value c= ^v « It should be noted that this value 

u 

of k defines tl:e radius of the circle centered at (u ,u ) which includes 

X y 

(l»y)100y» of the bivariate probability mass. The value of k obtained 
from method I defines the ellipse v;hich includes (1*\/)100T', of the 



bivariate probability mass. 
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4,5 iroceclire for .ic:Towinz Outliers Jsinn^ ..ethod I or ..ethod II, 

; '!C nct.^al procedure to rerrove the outliers differs from the 'is~ 

cussion in feclions 4*3 and 4.4 in that the probability l-\/is only 

exact if t!\e true val.ies of the parameters u ,u and are 

X y X ’ V y X , y 

used. It should be noted that both procedures substitute estimates of 

t’lose parar-eters for the true values and therefore tiie probability of 

'hype I error is not exactly equal to V . the first step is to find 

Gstiiuators for and(^ fror.i the n observed points (x ^y )oo« 

:: y X y 11 

. . • (x .y ). this can be done using either model I or model II from 
n. n 

lections II and III respectively. !’he model used depends on vdiich basic 
ass’.rption is made about the true values of the means If it 

is assumed that u^.~u then model I can be used. If it is assji:ec 

y 

that ^0, and/or then model II ca:i be used. Also^ the criterion 

of relative efficiency can be used to determine ".:hcther model I or 
.odd II should be usqd. The estimates of the parameters 

— 5 2 P TT ^ ^T” ^ 

I y > V, v^, then computed by using the selected mode]. 

The estimated value of the matrix A is coi.^puted next using the above 



estimates. 



(4.12) 





formally the value is predetermined by 
outlier rejected on. the basis of this value. 



the experimenter and the 
It is advisable to delete 



outliers one at 
region prescribed 
duo to tiic fact tha 
the data points and 



a time until all of the data points are inside the 
by the proba:;ility l-/and the method used. This is 
t the estimated shap<^* of the c rvo is dependent 'upon 
each ddete'- point mill produce some change in the 



49 



cstx:natcy' shape o tlv: «,_asil:y ^.'unction, Jhc : is rM-'-.oVv.* 

hy iavestisating points farthest fron: the cscir.atec' .i-an val'jc 
and the point is deleted uhose estiiv.ated quadratic fcrni 

i.G greater tiian trethod .1) or (for i.isthod II). If there 

are t;:o or iroro points \:hich satisfy this requirement, the point is 
deleted Iirst vhich !ias the greatest valued quadratic form. 

It is then necessary to recompute the estimators and use the above 
procedures again^ thus removing outliers one at a tirae, until there are 
no points left v;ith estimated quadratic forms greater than (i~lor2) . 
The fitiai estimate of the CZV is then determined fror. the estimators 



derived using the data from the remaining observations^ fhis estimate 
of the CEP v/ill oe referred to as CEP^ . i7here the subscriot i refers 

2i 

to the number of data points removed. 



4.6 Infomiation About the Problems. 

In order to illustrate the above methods, the sample problems given 
in lection 2.5 ;:cre used. :bdel II x;as chosen arbitrarily for csti.:.a-* 
ting the parameters for illustrative purposes. Both methods cf reject- 
ing outliers were set up for each problem case but instead of rejecting 
outliers \7ith any specific probability^ the tables x;ere sot up to shou 
the probability that a specific data point could be rejected. This was 
done in order to compare the two methodSo 
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iroble" I, Case I. Ceca points and computational results. 

’ ‘ I 

(! 



I 






ii'h 



e 



P 






Si:cp 1* In 01 ‘der to red^ice co..*pot.:-« 

tion, it is only necessary to find tec 

naxjniuir values of Z*^li in each of the 

i i 

steps in removing the outliers* 

/\ 

-l-'-V 



Z^qAZ^,^“ 5.27, Z'AZ^==3.32, Z;f'._=3.31 









i 1» V 

.. 0 
. :5 
.?75 



i.ethod 1 



4*fl 



0. - . 



: Gthoc 
c< 



5.:3 



Data loints in Problem I, .'=10 
"iagram 22 

Conclusion: for point 10 is greatest and can be removed uith 

''0% probability by Method 1 and 955^ probability by Method. 2* 



he rccov.iputed estimators, after deleting point 10 are then 

A \ ^ 

~ 2 # 3o , ■ 



.% y - 1.2,^= 2.;4, ^y= 2.3S, - .41, P = .06, V^, = 2.4 



iependent 
. -odel 



c = .31 



k = i.OG 

CEP^j-a.i 



Independent 
. odel 

c* = .'1 

k* = 1.06 i 






Step 2o The procedure should no".’ be continued \:itZ the 9 reT.aining data 
points to determine if any of the. rer.iaining data points can ’-e removed 
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prol:ability 



\;ith .,1 S;’c ci 1 ' i 0.' [>roi a’-'i H !;y c:,'' 00. 1 -..cthou 1 or 3, 

usin;’ uji-’icf:! 2. la pro’cU';.. Clu ro arc no norn oucliors. 

rroulci.. I, 0.-i;5o II, ,:aia points an.I computational results. 



© 




littp 1. 



=» 5.04 



'10-“-io 



1- V 



. o 



1 r 

♦ / -^ 



' 15 ' 15 



" 7,74 



Metl'.od 1 

r 

* -■ ■ 
7.30 
..21 



I-ctnod 2 

•1.3 

3.10 

IJO 



23 



Conclusion t ;'or data point 15 is arcntcr.t and can be re...ovcd 

'.;itl'. ,7.5/; probability by Method 1 and Method 2. 

Tlie I'cco.TipuLcd cstii..ators after dclctinp; point 15 ai'c then 
X 1.2, y w 1.', 2.32, 3,.Vi, '^y « 3.45, ” 3.30, 2.2C, 

r =- .303 



.( i>cndcnt 


Indcpcnt'cnt 


. .odd 


. odd 


c « . '32 


c*- .7''^2 


1: .3 54 




CEP2i«3.41 


-■Ob* 
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Ctcp 2. -;ic procedure io now continued v;ith the 14 renaininp; data 
points to deterninc if any of the rer;:aining points can be reraved i7itb 



a specified probability of '5T1. 



’ A.: = 5.53 

10 10 


1-V 




Method 1 ' 


. ethod 2 


1 


i .25 1 




i; 


4.41 




: • . 


L - 


•7 0 «') 

. • Jo 


5.54 



Conclusion: >Z for data point 10 is tlic largest and can be removed 

with 057, probability by method 2 but would not be 

reraoved as an outlier by method 1* dor purposes of illustration, t*iis 
data point \7ill bo removed. T’lC rocor..puted estimators after removing 
point 13 arc 



1,7, 1^=2.47,'^= 2.71, . 1, 4,= 2.70,^= 2.37, i = .136 



, Jopendent i Inclenoudcnt ^ 

;.ocIel Ibdol I 

c = . 35 j C"= /I I 

I I< =1.09 I !;■•'•= j',/^ 

c::r.,.,«3.04 02 : 1 b,>'/ 

£2 22 

Otep 3. 'rlie procedure is a^ain continued vjith t'lc. 13 rer.'.ainin^, data 
points to determine i' any of the rc. aiuinj points can be re..»vcd wit> 
a specified probability of 95T'. In tliis cxar.iplc there are no more outlie 
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pi'obler II, Case I, ,'ata points and cornpatational results 




-6 -V o a V 



Oatn Points in ri'oblc" II, fl=10 
Diaernn 24 



Ctep 1. 


_1 


6. IS 




1-V 


.■ ctl’iod 


1 


. ethod 


, .95 


5. >9 




- 


1.975 


7.33 




5.22 


.99 






6.85 



Conclusion; Z'AC Tor point 1 is greatest and can be removed \;itu 
?5'I probability by nothod I and probability by rcet’iod 2. 



?be recov’putod csti..ators after deleting point 1 arc 

y\ ^ 



y\ 

% 



'»5, y - 2.ir,vy- 1.32,^^- .79, f- .201, 

’ 1,72 






Dependent 

.-odel 


Independent: j 
Model 1 


c « .770 


C*- 


U -1.039 


1?*“ /. o?3 


: '21 





Itep 2, Tlic pi’occdurc, using tlic I remaining points, docs not reject 



any pore data points in this px'oble. , 
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,Ci.c and coiiiputat ional results* 



i;tep 1. 



A 

f ‘ 



c; 



xax/> 



0 c>>- 



! P 



\0 



.0 



^ V 






'V 



I 



il-V/ 

.■>5 

.075 

.900 

.995 



.'.'ctViocl 1 

Kf 

5.99 

7 . 3'-' 



Gtlio;.i 2 
,.2 
‘'j. 



7.07 



~i> -V -3. IQ * ^ 

;nta loints in Probler.-! II, l.'=15 
9iaprar.-i 25 



Ccnclunion: for point; 1 is Greatest; anc' can be rev.ioved with 

95Ti probability by method 1 and 99/o probability by method 2. 



fhe re computed estimators after deleting point 1 are 
• = -.5, y = -.7,^- 2.16,^- 2.53, -.2, f ” -.03, 1^- 2,1 



DcpenJciit 
I :oucl 



k -1*076 

Cl\ «2*9^0 
J Jl 



j luclcpcndent ' 
j I-bdcl 

I c*-.s-6r 

i 



c:i -- 

21 



step 2. 'ho i rocedure usinp, the lA rct.nininf' points does not reject 
any :.orc data point;;. 
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5.1 Introduction 

riic previously introduced ostir.iates of the CdP are all called 
point estimates where the estimate of the C2P was defined by the locus 
of a point moving at a constant distance (the radius) from a fixed 
point (called the mean or(u.^,u^ )j. This constant distaiice or radius 
is called the CEP. The confidence interval of the CE? attempts to 
give some measure of the possible error in the estimate of the CEP. 

Tlie confidence is defined as the probability that the true value of 
the CEP lies in an interval between Lj^ and where and are 
functions of the random observations (iipVj) , i “ 1,2, n. f’lis ex- 

pression in probability notation is 



(5.1) PfL, (X ,..X ,Y ,...Y ) -iCSP 4I..(X ...X ,Y ...Y )1 ~ \ - c< 

ci-i ni .1 Cl n*- 

This interval estimate is a function of the confidence required, the 
number of observations, and the estimate of the standard deviation used, 



5.2 Obtaining the Interval Estimate 

In ox'der to avoid lengthy computation in obtaining the interval 
estimate, it is assumed that tlie variances are equal. That is 

The CEP ’.:as defined in Eoctlon 1.3 as being equal t;o k 7" \7hcre t''.r 
value h is a function of the ratio of the variances and the probability 
that the mean centered circle contains 5f'% of the bivariate density mass, 

fee Eect ion 2.3 
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Since the vai’iances are assumed to bo equal, the ratio or the variances 
is 1, and r(k,i) “ .5, so that k = 1.177A (fror.i Table 1 x- 7 ith c = !)» 
.\lth 0 u 3 h the variances arc assumed to be equal, th.o. estimates or the 
variances are not necessarily equal. 

The estimate qS the standard deviation v;ill be deteminsd by the follow- 
ing two methods. 



C O 1 
J f 4 • 1 



Oetermlnins 
In this method 



the Oenfidence Interval, 
ma;{\T2, 



Method 1 
be selected to 



represent 




That is 

(5.2) « 



n -1 L n -1 




0 

lfV£ is 
n- 1 , this 

(5,3) 



divided by the true value of the parameter and multiplied by 
formula becomes 



Although the sum in (3,3) vjill not be an exact chi squared random vari- 
able because it is the maximum of tw chi squared random variables, an 
approximate confidence Interval can bo obtained by treating (5.3) as 
though It were a chi squared random variable,^ 

'i’liO confidence interval defined by (5,1) thus becomes 

(5.4) l-'x. « v( cy ^ < X/\/-y j - 




v'jee Section 4.3, 
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.'he values of A A , obtained by entering 

table 4 v;ith n-1 and either l»o<'2 or ot'2 respectively . 

542.2 Jelerr.uning the Confidence Interval^ I'ethod 2, 

The cntiruaLe of the variance in this method is the avera.^c of the 



tvx) estimates. '.That is 



<5.3, -- jrf I (£^ . i 

^ A A/-/ A/-I J 

If (5.5) is divided by the true value of th.e parameter and multipl ied 

by 2(n-l), the formula becomes 

3 5) r lir/v-xr <- Kh-y)''] 

d. 



C 



^./here 2 m and are normally and independen tly distributed and x and y 
are the sample means. Tb.is formula can be reduced by letting; the values 
of i range fromi 1 to n and the values of j range from n + 1 to 2n. Then 
the forc'iula becomes 



(5.7) . 0 V/ = S ~ ^ 



K'-\ 



n’h.ere 2 ,^ = n, for I< » and z. - y for !c “ n •^ 1 ,.2n an-’ 

iC I< k 

there arc 2(n^l) squares in the sum. Thus (5.7) has a jl distribution 
’v.dtli 2(n-l) degr^^es of frecdon by tho: definition given in (4.5)^ The 
interval estimate is deterained in the sane \;ay as in (3.4) and the 



formvila beco: ec 






(5.2) ?( \<[7 7V V z‘^3,(^J-i) y U 71 j Tz> } 

f^X~ =— ^ ^ •-==-“ 



i - w: 
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:.c vol >s obtainca by entering 

^v^blc 4 \7U:'^ 2(n-l) and oiV^er l-oC'2 or X/2 respectively. It sliould 

be noted tliat this nethoc o2 interval estimation is not as conserva~ 

tivo as method 1 because the avcra 3 e value is al\:ays less than the 
1 :i\ 



maxinum of Therefore , this interval estimate \vnll bo smaller. 



5,3 Illustration 

The estimates of the confidence interval of the CEP used in the 
folloxji ;3 illustrations T-7crc obtained \7ith the data from Section 3 
and •'^5. A comparison is male betu’cen method 1 and method 2 

as veil as a variation of tlic t\x> methods uh'^re the dc^ ent istiraate 

of t:ve v>di s ddlb, = h v^) v’as substituted for 1,1774 ^ 2 ^, It should be 

c;..phasisGv. tint none of the distribution theory used in I.ctho(' 2 holds 

^ s ^ 

..i;en is used for k Vi^ . dl.erctorGj it is hard tp get a i.iatbc- 

matically m.caningfal comparison betveen these methods, 

fable b shovs the various estimates of the CEP, The best esti: ate 

of the CdP is liiost likely to be due to the basic assumptions of 

/A 

dependence and unequal variances, Tae estimate of 1,1774 is the 

max 

largest estimate of the CdP and therefore the most conservative estir.ate 



of the CCr. 
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It is noted that the lovjor bound estiv.'.ates are for all practical purposes 
the sa’..';e for both, r.cthods, with the averase difference being only .01. 
H-ov.-ever j the upper bound differences show that n'.ethod 1 gives a greater 
cstiiT-ate with the average difference being 1.50. The lengths of the 
confidence intervals are compared in Table o belo\.n 



Table e 
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It should be noted t,\at the confidence interval becomes smaller as the 
number of observations increase. I’his ir.iplies that the true value of 
the CfP is rore likely to be •L 7 ithin a smaller interval as the number of 
observations increase • 

fviagrams 28,29 and 30 shov? the confidence interval using the 
different estimates. The confidence intervals \:ere obtained by using 
the data from case III of each of the problems. 



5.4 Conclusions 



fcthoc 1 , using 1.1774^,,,^,, produces the largest estimates and 
therefore is the r.iost conservative estia-.ate of the confidence interval. 
fou'Gvcr^ C.CI 9 and 1.1774'^.^ are likely to be better estiraates of the 
CC? and therefore raetUod 2 or tlic approxi:;ate interval using the depen- 



dent estimate CZF 2 may be t'ue best method for esti.mating the confidence 
interval, .hi analysis of actual missile data should give a more realis- 
tic insight into the best choice of methods to use in estimating the 
confidence interval. In order to come to any definite conclusions about 
the different methods, some comprehensive distribution theory problems 
must be solved. 
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1 Introduction 

The previous sections have been concerned x^ith the development of 
different types of rnodeis and methods for estimating the radius of the 
mean centered circle which includes 50% of a bivariate probability 
masse This section summarizes the different models and methods used 
in the previous sections^ and includes an analysis of the results 
obtained from problems. Although the sample problems do not represent 
actual missile test results^ an attempt has been made to make the data 
as realistic as possible. Therefore an analysis of the problems 
should show certain relationships betvzeen the models used to estimate 
the CUP that xv’ould also apply to actual missile test data. 

6.2 Comparison Cf ..'odel I di th .Yodel II. 

The basic underlying assumption made in Yodel I was that the true 
value of the mean was located at the target, (0,0) <» Therefore, the 
CEP in this model is defined as the radius of a circle around the 



target. 



The 
value of 
target. 



basic underlying assumption made in Yodel II v7as that the true 
the mean v;as located at some ooint (v .,u ) away fx"on the 

>v y ^ 

Thereforej^ the estimated CCP for this .uodel is the radius of 



a circle with center at some pointy (>--y) 
A comparison of the estimate of the 
that they change in much the sane manner 
a major difference betv«7eeri these i 

This is shovTn in Olagrr^.n.. "•! . 32^^ and C3 



correlation coefficient shows 
in both iiK)deis. As suspected, 
3 ua t’.-.e iocatxon of n and y, 
V7hich illustrate the estimates 




*S' 







of the C.:::i « 



The i'.sti*.:atc of the Cfc for ^-roblo-.s 2 and 3 is practically 
the sane in all three caser-^ ThevGCcro. v:hen ihc center of the distri'=» 
bution is near the targetj there is little practical difference between 
the two .nodelSo r.oweverj, in problem the distribution of data points 
is around so.ne point (Xs,y) away froiri the center » If the procedure 
given in Appendix 3 is used to estimate the ratio function^ then the 
values obtained indicate that CE ?2 gives the best estimate of the CEP 
for a sample size of 10 in problem Aiso^ as the sample size increase 
the ratio function increases^ thus ^^’2 the best estimate for 

n > ID. The values of R.F. obtained for problems 2 and 3 show a pre- 
ference for Model I for small sample sizes and are very close to 1 for 
large sample sizes and therefore either estimate may be used. 

These problems tend to substantiate the fact that the procedure of 
Model II is superior to the procedure of Model I in large sample sizes. 
They also suggest that if Model I is used in analyzing a small number 
of observations, it might be advantageous to check the assumption of 
mean (0,0) by computing the sample means. 
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-o3 Co ..pari son Of Tae Iiidependcnt .Viu OepenCeut ..cthods Of Zstlruating 
The CET. 

In the introducrion to the probien. of estimating the CJiPj the 
assumption nas made that the errors in the x and y directions .;ere not 
independent. This assumption is natural unless an apriori knowledge 
suggests that the errors in the .x and y directions are independent, 
'ot.’ever, the assumption of independence In the fire control problem is 
quite difficult to justify due to its complexity, Thereferej it uould 
seem wise to estimate the magnitude c£ the error involved in assuming 
independence in order to find out how much difference this assumption 
\;ili mean in the dcteri-iination of the CdPc 

It was sho;-ai in Appendix A that the true orientation of the density 
function was related to tVie correlation coefficient. If the true shape 
of the density function is oriented at some angle with respect to the 
X and y axes and independence is assumed, the computed standard devia- 
tion is not the best estimate of the standard deviation. Consequently 
the independence assumption introduces an additional error in the 
estimate of the CdP. 

Table f is used to illustrate some of the important differences 
in the results obtained from the problems using the two models. 
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Tab^ .. f 

Cor:'iputr-d ‘.1 "£ ei‘c;nces .oCtls I and II 



Irobler.i 



[ 



Ibdel I 



Model II 



Oi f ferences 



I ° ..ocl el 







hadius 


Ciff in 


Correl 




ladies 


Di f f i n 


■Cori'el J 




hadius 


Esc „ 
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of CLT 
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coef f 


O A. w .5iP 


Ct s t . 0 1 


coef t. 


f 0 < ccr 
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; ccef f . 








scanr’, 

dev. 




CEP 


stand, 

dev, 

1 < 

.i 


' P 1 


s; -‘-i 


stand i 
dGV._J 


1 o 

j p 
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( • i 

i ' 


1 1 


.45 
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Cn sc 
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3.97 


. 50 
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.475 i 


3,64 


.15 


' 1 ’ 

’ . 3S0 1 


.33 


.095 
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4,15 


.54 


,206 1 


n ">7 


.7* 


: .081 I 


.25 


•=.23 


.125 
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3,55 


,54 

1 


.204 ; 


3.25 


.32 


i .320 1 

! 

i 


.27 


.22 


-.116 


II 

Case 


1 

i> 


3.37 


o57 


1 1 

-.454 i 


3,33 


,81 


i 

- , 6 2 6 1 


,04 


-.24 


-.072 
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3.45 


. 86 


»,25o 




3.39 


1,06 


•=■.395 i 


.03 


- , 20 


-.133 
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3.77 i 


j .43 


.031 




3.71 


.62 


•>.10? ' 
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-.19 


-.076 


III 
















1 

1 








Case 
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4.32 


*5 

. O J 


.735 




i 4,66 


.69 


.625 


-.34 


-.06 


.110 
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3.72 


.57 


.695 




3.56 


.65 


, v03 I 


.05 


-.08 


-.203 




3 : 


2.40 


.35 


.660 




3.52 


.65 


.650 i 


-.12 


> -.30 
1 

J _ J 


.010 



TMe table sbo'.’s some difference in the magnitude of the radius of 
the C2P as estimated by the tvo s^odels with the maxi;.ium difference being 
•SS/Be?? or 3,3%* Also, the trend in the size remains constant between 
the two m:odels. That is, as t!ie size of the estimated CSP changes in 
one model, it changes in the other model in thci same direction^ 

Graph 1 shows a plot of the percent difference in the independent 
and dependent estimates versus t)\e correlation coefficient* It should be 
emphasized that the points on the graph were obtained from data computed 
frem the sample problem. 

The differences in the estimates of the CEP from the problems are 
shccmi in Diagrams 34, 35. and 36* Dome of the differences were so 
small chat these estimates were left offo It is interesting to note 
that the di str ib?iticn of deta i:. problem 3 :>hovs almost perfect correla- 
tion and the estimated differon^^es wer: also a ..axinum® 
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C,h :^fI;ectG Caused By The .Icnoval Of Outliers, 

The most obvious effect on the CCP when outliers are removed is 
that the CCP becomes smaller. Towever, there are several otI;er effects 
ij'.iich are not obvious but may be important in determining v;hich estima- 
tors can be used. Table g, using the sample px'oblems, gives a coimparison 
betv 7 een Method I and !!cthod II and the estimates of certain parameters 
before and after I'emoval of outliers. 



Table 3 
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Of !le.*.ioving Outliers On i 
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' before 


j After 
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j outlier 


1 outlier 
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1 

1 


ircmoved* removed 


. re;rip ved. 


[^_re.'.ioved,| 


j removed 


' removed 


I 

Case 1 
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.05 
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3.54 


3.12 


.333 


‘ 1 
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; .059 
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1 V 
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3.33 
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3.03 
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.69 
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The estimate of the CEP v;ac reduced by from 147, to 357, in the 
problems by the resioval of outliers, .If a probability of the fype I 
error had been specified as ,05, the point rejected as an outlier iii 
problem 1 vxjuld not have been rejected by the elliptical method but 
would have been rejected by Method II, This is because K'etliod I and 
Metliod II are not the same and \ 7 ill not necessarily reject the same 
points for the same confidence level. The effects of iremoving outliers 
are shpvni in Diagranis 37, 30, and 39, 
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It clsould bo noted that the rernoval of outliers may change both 
the correlation coefficients and the difference betvjeen the standard 
deviations in the x and y directions. This is due to the large effect 
that an outlier has upon the distribution parameters. Thus a large 
correlation coefficient may be due to the presence of an outlier and 
not due to correlation between the errors in the x and y directions. 
Therefore, before the independent method of estimation is rejected, an 
investigation should be made for outliers. 
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A,1 Introduction 

This appendix is concerned with the orientation of the bivariate 
normal density function over the x,y plane. Primarily this requires an 
investigation of the correlation betvjeen the random variables X and Y 
and once the correlation is determined, a transformation of axes so 
that the function can be integrated more easily, 

A, 2 Orientation of the Axes 

If the correlation coefficient is zero, that is the random variables 
X and Y arc independent, the orientation will be symmetrical vjith 
respect to the x and y axes. This means that a plane parallel to the 
X and y piano will cut the density function in the form of an ellipse 
v;ho»G minor and major axes are parallel to the x and y axes. This is 
shotm bolov? in figure A,l, Note that ifV)<“7y, the ellipse becomes a 



Orientation of tlie Ellipse Jhen P “0. 

Figure A, 1 

If the correlation coefficient is not zero and less than plus or 
minus 1, the orientation is offset from the x and y axes in the direction 
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Oriontstiou cf t’ e ZWi'pso. 
-..-hen C<P<1 
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ioi\ :. i: '\he /-ilip: 

’.'*on -1<P<10 



Figure A«2a Figure :.o2b 

ihe error intioFiuced in assuiping ini-epencence^ when the rando.:i 
variables and Y arc not indepenoenr. fu^ctiori of tuc correlation 

coefficient and is due to the true orientation of the density fuuccicn 
with respect to the x and y axes^ If it is assui.ied tliat the erirc s in 
the X and y directions ar 2 indepenaenc \7hcn in fact they are not. an 
additional erroi •■’ill be ..ntde in cor.puting the estlr.ates of the vanancis 

A ✓x 

:his is due to t •g fact t'tat the computation cf will be in the eirec 
tion of the assu.;;ed axes ;Xj,y) instead cf the direction of true orien- 
tation h('v|Y) 3 w(fi 0» Yhe«efci:e. it becorres importcviit to obtain son:; 
knowledge cf the true orientation in order to oatain the best estimates 
of the variancGCe, This can he doxie by obtaining estimates of the jnglec- 
br'tvoeri the ass.ii.'ed axes and the true axeso 
The collowing sections are devoted to the diifeient possible orien- 
tations ere to Xjo c-lfferen ' r^riges of the corio: elation coe : f n cient « 



YctexuU3“ianiL,u of the Client r-u::::! ot -Oves 



t’lc Co5:rolciri.. n Cceffici' it 



..r i.^ up Cr; : cz: 



iti. 



Jeternun ■ -n 
is zr.io» 

11 P tiie i-'ando' . variables X and 1 aie i .’ ''epe'-'-rlrnt o lizerercr.e 
Z(XiY) = Y(”) and dT' j \) -■ c Ibis caii prov^en b} Jet 2 r:rin=^cicu 

of either the value of che conditional expe^^cat ion di-. octlv or ladlieccl^ 
usins the linear preaictoi% 

A. 3. 1*1 direct Jcteriai nation of the Co'nli t ional Expectation if p 

The expected value of one randorr; variable given the value of the 



other randoiT] variable was defined in section l«2o5 as 
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The E(Y)X) can be dctemined in the sar^-e way azid is equal to hi:). 

A*3.1.2 Indirect Eeteri^iination of the Conditional Expectation bsing the 
Aest Linear Predictor, 

The conditional expectation of one random variable given the value 
of ti\e other rando.n variable is a linear function of the knovo'i random 
variable \;hen both random variables are jointly normally distributed, 
rhac is the E(’'IT) ~ Ay Tc, xAnere A and B conscants Xvhich can he 
determined, Tb.e linear predictors for the c-nd^tionxil expectations indci 
consideration ; Ji be defined as foliews:: 
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= 2(/-) + C„[Y»av/)] vn-.ere C. ^ CCV(X, Y) ^ ^ 



v'Aa(x) 



^ J-J • V ’ r < ' \ 

'fWrntrfem 



from 1,2,5 



^y' 



In the case vhere P =0 « 00V(X,Y) , C0V(X,Y) = C = C = C from (A, 2) 

VAR(X) 1 2 



Therefore, the results become the same as in Section A, 2. 1.1. That 1; 



(A.3) T(X/Y) - T(::) and 
2(Y/X) ^TCY) 






In the case that P =0, the orientation of the axes will be as shovni in 
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Orientation of the Pocc-.s iv’hen 
the Correlation Coefficient 
is Zero. 



Tisure A. 3a 



Orientation of the Density 
Tunction when the Correlation 
'Coefficient is Zero. 
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Figure P..3b 
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Ao 3^ 2 



3c terra na 1:1 on of the Crientatiori If (" ~ 1* 

If = Ic, the ZiT:) ^nc J:(i/X) mil He along the sar.v:: axis. jhis 
can be proven by asing the best linear predictors in foivuia (A, 2) and 
the definition of the correlation coefficiento fhat is 



(\.4) f = i = COV(",V) and therefore COV(.:,Y) =/7SU0y7Ad(Y) 



Then using formula (Ao2) E(Y/:0=2(Y) vC^[('l= T( 0] =E(Y)»Cj Y(:0+C^X 

\7here 



(A„5) 



'1 “ cove h’O “ f '£^('11 

varVx) V va.Ux) 



using the result in formula A»4» Since 



2(Y), and] V/M; (Y) are constants^ the random variable S(Aj:0 



the for:.: 
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(A. 5) 


2(yI;0-a;:+" 


■where 


and 


3-E(Y)-EC0 


fhe tangent of the 


angle 


between the ) 


/ axis and the 


line 


E(Yl:0 is 




(A. 7) 


ci[;:(Y/ :)3 
dx 


~ ^an9 “ 




VA:1(Y) in 

>'V "In** 


the case v;here P~ H 



The tangent of the angle between the x axis and tiie line E('riY) is 
determined in t’ae same vay and in this case 



(Ao fany 





the case tdiere P 




fan© 






the lines must be the same 



Tlio lo ^ o re 5 i n c- c-r^ s r- 



P ~ 1 the oi ^en L..?;. ion of tae axes \ti11 ' o - 




Orientation o£ the ^ensity 
Function xfnen the Correlation 
Coefficient is !• 

Figure A»4a 



Orientation of the Axes ulien 
the Correlation Coefficient 
is K 

Figure A, 4b 



A. 3,3 Dcterp.inat ion of the Orientation of the fo;cs if Oc^P^ 1« 

I: 0 < P < i, the two lines E(Fv| /) and C(V|:0 x^ill not be the same 
or perpendicular and \;ill be oriented as showTi in figures Ao5a and Ao5b, 
This can be proven by using the same method as in Section A.3<»2, except 
that 

(AA ) 0 ^ XVC^V) 1 

VAu(:<) vaH(^o 

It follo'rs from this that Od CCiV(X/i) < / VA?,(Y)o Then using 

formulas A«^7 and A«8 with the definitions of the constants in formulas 



the desired angles are 



r 



the cov.ipleto range of possible values for 



the C.UO angles using \,9 arc 



A. 11) 0< ^<Tan 
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0< Q < 




In this case the orientation of the axes be as shevTn 



in figures 




Orientation of the density 
Function i.nien 0 < f ^ 1* 



Orientation of 
the Correlation 
is 0 



the Axes when 
Coefficient 
< f < U 



Figure \o5a 



Figure A. 5b 



Determination of the Orientation of the axes if Oo 

If it follows from formulas Ao9, AolO and A.* 11 that 



(A. 12) 



I va:iU 



^ 9 < 0, 
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tan 



-1 



VA.1(:Q K 0 < 0 

\J v.\.i( 0 



In this case the orisutation of the axes will be as shova in fioures 
A. 6a axid A. 6b. 



V o 





Oricntatiion of the Density Orientation of the A:<es 

Function \7hen < P < 0 uhen -1 < ^< 0* 

Figure A. 5a Figure A, 6b 



A. 4 111 jstrations 

Mthojgh the true orientation of 
can be estimated by using the various 



the axes will not be kno™, it 
estimators shoxai in table h belov;: 



! Table h 


j Estimators Lsed in Determining Estimated Axes Orientation 


Estimator used 


Value Estimated 


i 

i 

i 

1 

i 

1 

1 


ECO 

E(-:) 

XT A 'T 
/:ij\ 

va;;Cy) 

,/va:ux) 

\/VM(Y) 

C0V(X,Y) 



The estimated parameters in the illustrations which follow are 
determined by using the data from the example problems in Section II. 

A. 4.1 Illustration (1) 

The data is obtained from example problem no. 1 x^ith a sample size 
of n “ 25. 
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Z X '2s 

T; - 1 c 2 j y = 1 » 5 5 ^ = c • 0 , = C « 6 ^ ^y = » • 4 » \ 

fO^:) - iTc^) + c^L:' - :T("o3 , sHy) = - iTcy)J 






whei*e '^y =»,05 = Tan9 

<3- 3 

v;^ 



S ~ V^y = ”.04 = Tan55 



a = I76°5i® 5? = 17703 I' 

i?he orientation oj: the axes is shovm in figure A. 7 




X AXIS 






Esti-nated Orientation 
of the /ixes 



Figure A. 7 



It should be noted that the orientation of the axes in figure A, 7 
iriiplies that the random variables X and Y arc nearly independent and 
that the independent nxidel of computing the COF can be used v;ith only 
a small error due to the orientation. The computed values for the t;;o 
different estimates of the CSF are 'CSP 2 = 3.28 and 
A, 4, 2 Illustration (2) 

The data obtained from problem 3 with a sample size of n = 15. 
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VAR(X) 

a r= 



^2 " C07 (.:.Y) - .753 = Tan0 
VAR(Y) 

3 = 37 ^:/ 



orientation of. the axes is shov-n in figure Ao'?> 




;:stin:ated Orientation of the 7a<es 
Wien Dependence is Iinplied 

jigure Ao7 



It should be noted that if this were the true orientation, it 
implies almost perfect correlation between tl-e random variables X and 
This orientation \;ill exhibit the greatest difference in the esti« 
rates of the CDP if independence vere initially assumed. The computed 
values for the t\7o different estimates of the CEP arc 3,52 and 

CT?2 = 3,72, 

A. 4, 3 Illustration (3) 

The data for this illustration is also obtained ^rom problem 3 with 
a ‘sample si^e of n ~ 15o owever^ in this case^ the -\x> outliers have 
the sample si^e used for the coiupucation is 13, 



been removed and 



-A 



A 



.„8, y = = 3e64, Ty = '^y= ,32, f' = ,07 



E(YI.O = .-(Y) + ca'^rcol 



E( -!•:) = -c c.[y-2(-;)] 



unere 



C.^ 

"0 ... r.O *3 P 






*"2 ~ Vxv = ,056 = Tanj! 
= 3'545* 



The orientation of the axes after renoval of Che outliers is shoun in 
figure A, 9 




Estinated Orientatioii of the .'u<es 
After IleTOval of the Outliers 



Figure .. 



It sliould be noted that the removal of the outliers i*otated th.e 
axes enough so that independence could be assumed with, only a small error 
in the estimate of the C2P, The computed values for the two different 
estimates of the CEP are 19^^" 2,42 and CS^‘^' “ 2,52. T..us the removal 

^ «!t, 

of outliers will not only reduce the size of the CEP but .r.ay aid in the 
deter.r.inacion of whether the simpler r.iodel of independent escimates 
na" be used or not^, 



.rans..omat4cn or tne y\>ces 
In order tc f: 



: ( 1 ,> I ) c> c c I jii 1 ^ it is ne ce s sary 
o transforr. the axc;S.> TIJ. •: errs.sfcu,;t'‘::'on can be done In several XA7ays 



ICd 



I 




'^:i 



but the ui.Q ol .i.atrix notation can f^reatly si mplify the procedure. 

It is necessary to first define soi'ae of the concepts which X7ill be used, 
.\,5* 1 Def ini tions ; 



AoSelol The r.iacrix A- (a — ) where 

j 

s inpl 1 f i cation e 



a4 



• 



V ®21 



^ 12 ’ 



22 - 



will be used for 



5,1,2 The cransposed -.riatrix is defined as A*-. 



^^11 ’'^ 2V 



1 a _ I 

^ 12 22^ 



A. 5. 1,2,1 rheorer.-; 1, The transpose of A*==(A*)*=A 

A. 5. 1,3 Th.e inverse of A is defined as the matrix A"^ such that 



t 1) 



A. 5. 1,4 The identity matrix I =^''1 Oj 

A, 5, 1.5 A sy, metric matrix is defined as a matrix such that the 
transpose of the matrix A equals A, Tliat is 

1 a<%,\ /®i, 



^11 "22\ /“ll 



12' 



A* = 



^12 ^ 22 ^ V ®21 ^ 22 ‘ 



= A 



A. 5.1,6 If C is a 2x2 matrix such that C* C = I, then C is defined 
as an orthogonal matrix and C*==C“^. 

A. 5. 1.7 characteristic root of a 2x2 matrix A is a scalar X such 
that AX- A X and .'Td- AX«0 for some vector X A 0, It follow 
that if A is a characteristic root of then (x‘w Al)X=0 
and therefore ^A-Ai|“ 



.i^'i ^ iv, define- <ic a S'l’isurc ;iai.i*ix 



’.'haac o^,. '■■lacoual cJ civicnts ara all zero. Jl.at is. 






Clio 



d.,----0 if l#j. In C'.u: 
1 : ■ ^ 



ya::-.rlc > 



mi 

lO 



0 \ 
’^22 



The quadjraclc for.n Q is defined as iZ 
A. 5, 2 Tlie bivariate nor.,ial density function in matrix uotacion i: 



? 7T /-“‘I 



1 - t 

I f 

y^Ty Ty^ 



/V er.TY\ 
i VyW 

It shouli' be noted that i\ and A“^ arc bold, synnietric inatrices, fliai; 
is A “ A' and Thus the theore.n applie.s that for every syr- 

metric iTiatrix A“^ there exists an ortiiogonal uiatrix C such tliat 
where D is a diagonal matrix whose diagonal elements are the character^ 
ietic roots of A“^. The matrix D wuld thus be 



/X, Ov 
I O Xa 



v/Iierc A, and X are the characteristic 
i"ool:s of the ;;iatrix V. 

In order to find the characteristic roots of A“^ t.’c must first use the 
idsntitv ;ri<atrix I 



Y1 Oy, then iX-p OjA = 0^ 



ZHe characteristic roots of a synmetric matrix are determined fro’u the 
cliaracteristi c polynomial CfA)«/A“'*» A l/ *0 
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•:!ua j atiC eq>aatiuii 
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L'he transToi -e.’ ■!o;-'sit> -ancticn is 

C.'.n) q ( ~ 



Jiiv' ' ' a/ 7 '/^** 7 >'» 






'Ja hot; have a nor.ral bivariate .'ni.,sit/ function ;ith indepenr.leuc laa’- 
variables t and V such teat the ^aatiix ’.; ^/V\is distributed h’ ^ / 



and "J ~ C* Z xdjere C is the ottho2,ouril leacrAX define ' v.h’cu sat'sfies 
the relations' ip C‘A“"C - A*'""® Ab3r':jf:or-" f- i also c'j.striA .ted. 
d vO <) A- " ) and the criAinai terins invclv-ni^ the. cccrel'ition constant '.j 
not appear in the quadiatlc n o - for this ca stributioua Ic 



/ 



should also be noted i.»iat A s'- 

O 






/^l '?he orientation of the 



ran vaiia-.lco U w..: V an* r* to r r i s '.ouTi n iiAure -a®!^ 



3 



X 




iillipsc For;aed by Cutting the 31 variate 
Isoriual Density Function^^y(u , v) by a Plane 
Parallel to the u,v Axes. 

F i gu re A • 1 

Fortunately it is not necessary to compute the orthogonal ...atrix 
uhich satisfies the relationships above since the characteristics of t 
orthogonal matrix C requires that C* C = J. and C’ " C"^. Then 

(A. 14) (VJ 'A*(’.; = (C'Z 'A*(C»Z =(Z • CA*C» (2 =(Z »A(Z 

but C*A"^C=A*-"1 

therefore (C' A”^ C) " * = (A*"^) "^=A* 

= C“^AC=A* 

CC"^ACC"^=CA*C"^ 

A =CA*C“' 

Therefore, it can also be shou’n tliat the corresponding areas under the 
density functions are equals That is 

( A. 1 5 ) ^ (u , v) du dv“ ^ (x , y ) dxdy 

where: 9,. ,.(’-',v) = 1 exc> 

‘ 

f- ,-0<:.y) " 1 exD -hZ'AZ 

21 |A“ifT 
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This is because ’J*A*v = Z*AZ as sho'ai above and / =j A"^| , It is 

sliovm above that C“iA"’'‘'C = A*'”^ and the detei'minates of the tv?o terms 

are /C“’ AC'- /A*"^/ = /C! = |A*“^/ ={A-^( I ci=/A"^| =/A*"^/, 

I Cl 



Therefore^^ ,,(u^v)dudy=jj'f^^ ,,(-,y)dxd^ 
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B,1 Introduction 

The theory of estimation is concerned with the problem of finding 
functions of the observations such that the distribution of these 
functions will be concentrated as closely as possible near the true 
values of the parameters estimateco The density function of the obser- 
vations under consideration was described in Section I and the para® 
meters v?hich are to be estimated are u and P « 

Some of the properties which are desired of the estimators were 

described in Section l*3o 

B*2 Kaximum Likelihood Estimation 



density function for a random sample of size n x^ith uuknoiTn parameters 



Since it is more convenient to deal with sums than products^ it 
is easier to maximize the logarithm of the likelihood function rather 
thvnn the likelihood function itself • It should be noted that the 



If fCXpX^ 





C ) IS the 




logarithin has its rr.axiraura at the same point; as does the likelihood 



function. The log of (B,l) is 




i06 



The n'.nx 1 .1111171 likelihood ectiniaCe of each of the unknovm parameters 
is obtained by setting the derivative of the function X 7 ith respect to 
each of the unknotai parameters equal to zero and then solving the 
resulting equations simultaneously. To illustrate this procedures, the 

^ "iL 

assumptions will be made that andf^= 0. For this special 

case, formula (S.2) becomes 



N 



(B,3) L' = -F log 27T -N log^° 
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-h (Y^-Uy f 



and the partial derivatives are 
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(YyC ^y) 
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- I(K- - V 
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-NUy , 



Zquating the partial derivatives to zero and solving simultaneously 
it follov7s that, 







Since maximum likelihood estimators are in general biased esti- 
matorsj it is necessary to examine them to see ijhether they are unbiased 
For example*, if the expected value of the estimator ^5ls equal to © 
where @ Is the true pr/ram.etcrr then 



i07 



(B.7) 



and 



in an unbiased estixator 



Eff) 



Q 



6, 



Since F(^i) " E-iVt) ~ '“J i “ l...,.»,n, it follovjt 

»— • IX 1 y 

that "x and y arc unbiased estimators for and u^ respectively. 

The expected value of the estimator in formula (13,6) is obtained 
by recognizing the fact that there are 2(n-l) independent squares in 
the sum and therefore 2n is a chi squared random variable with 

2(n-l) degrees of freedora as defined in formula (4,5), Since the ex» 
pectcd value of a chi squared random variable is equal to its degrees 
of fi*eedom, it follows Chat 



(B.8) E ) “2(n-l). 



Therefore, 



-BX 



and 



JL . 7 hx^-x; 



an 



unbiased estimator of V when the variances arc equal, ./hen the vari« 
anccs ai*e not equal, the same procedure may be used and the unbiased 
estimators of '^^and Vy^are 



(B.IO) 



V 



-A 



V 






‘ . 2 

X~-\ A/-/ 

. TLlizll 

a/~/ 



It .should be noted that the estimators (BolO) are used in /bdel II. 
Al.so, if the assumption is made that the true values of the means are 
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zei'o, then the estimators for the varimces in kodel I are also unbiased. 
The estimators of the means and variance used in ik>del I and Model II 
aro» apart from the biasing factors, maximum likelihood estimators, 
Hov7ever, it should be noted that the CUP is a function of the 
standard deviation and not the variance. Tne following section vail 
determine unbiased estimators for the CEP using the procedure in this 
section. 

3,3 Unbiased "Maximum Likelihood" Estimate Of The CEP; When 

» W “ ^ and P “ 0, 



The maximum likelihood functionof '7'when u^ 



0 i s 



(3,11) VT 




fln-o the sum in (3,11) divided by y* has a chi squared distribution, 
it follows that the square root of a chi squared random variable di- 
vided by its degrees of freedom has a chi distribution. The density 
function of a chi distributed random variable v;ith 2n degrees of free- 
dom in 



(D.12) f^(u) - 



an" i/"'' 



u > 0 



« 0 u ^ 0 

vrhex’G (n) ia the gamma function v;ith parameter n, 

Tlicn, 
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(3.13) E(v;) 



) r 

r {N) fN ' 



and 



vr 



(M 



f (/^* i) / 



IS an 






unbiased estimate of ST and therefore 

(B«14) CriP*- « 1.1774 TJ^ is an unbiased estimate of the CIDP. 



The mziximura likelihood estimator of ST when the means are not 



zero 15 
(B.15) Vg. 



Therefore, 



(3.ie> £(^) . 



.■L f[(x,.-,f,(y,-y)'J 

< '/ 



r(^-i) ^ 

ro^-nfv 



and 






(3.17) Vs. 



x)*{yj-y ri 



is 



an unbiased ectimate of 7” » IIiGrefore 

(-.13) C2P** = 1.1774 ’v^ is en unbiased estimate oi the CilP, 

The reader may be interested in the magnitudes of the biasing 
factors and a comparison of the biased and unbiased estimators o. the 
C^'. The results obtained using the c&ta from tlie sample probl :.n.s are 



presented in Tables i and j 



Table i 


Comparison of the Biasing Factors of 


the Two Estimators 


Case 


CEP** 




1 


3,, =/l0 /7l0) = 1.01 

TTioTsT 


3^, =/Ib r' (9) = 1.09 
r (9.5) 


1 

1 

2 

i 


B,„ =/l5 r (15) = 1.01 
/'(IS. 5) 


; 3^0 =1*15 r (14)= 1.04 

^(14. 5) 


3 


3,. =J1S r (25)= 1.005 
r(25.5) 


=(25 1.03 

/"(24.5) 



Table j 


Comparison of tlie Estimators with 


the Methods Used In Sections 


II and III 


1 Problem j 


Appendi:: 3 


Section II ! 


Appendix B 


Section III 


’ i 

1 




CEP 

1 : 




■gEP** 

2 


CEP^ 


CEP* 

2 


1 

Case 1 


4.20 


1 

3.97 ! 


.. 1 . T 

4.13 


1 

3.86 


3.64 


3.72 


Cose 2 


4.13 


3.84 


4.10 


3.95 


1 3.87 : 


3.88 


Case 3 


3.53 j 


3.43 


3.51 i 


3.29 


1 3.28 


3.26 


2 

Case 1 


3.55 


3.37 


3.50 


3.47 


3.33 


3.55 


Case 2 


3.59 


3.45 


3.51 


3.43 


3.39 


3.52 


Case 3 

1 


3.33 


3.77 


3.78 


3.78 


3.71 


3.74 


i 3 

Case 1 


4.76 


4.17 


1 

! 4.69 


4.76 


4.66 


5.02 


Case 2 


4.10 


’ 3.65 


1 4.03 


3.92 


3.56 


4.21 1 


Case 3 


3.69 


3.36 


3.66 


3.73 


3.52 


3.72 



Comparison Of The Txto Sstimatess Uelative 2fficiency 
Througliout this section it is assumed that -Vy - ^ and P = 0. 

It can be proven that CEP** has f^reater efficiency than any other 
unbiased linear sample statistic ’..’hen the mean value is (0,0). In case 



III 



the mean is not zero but is kno/Ti tu be snail. 



this estimate sho.ilcl be 



considerec!, CZ?^'* is asymptotically efficient whatever the population 
mean may be, hence, if the mean .is greatly different from (0,0), CdP';-* 
will be a better estimate than CEFi"'", llowever, because 2 degrees of 
freedom are lost in estii.iating the coordinates of the mean, the estimate 

V7ill not be as precise as for small values of (.i ,u ), 

z i X y 

In order to determine whether to use CEP** or CE?*"'' when it is 
knovm that the true mean is close to (0,0), it is necessary to compare 
the two estimates by some criterion. The method \.v.icli will be used is 
the ratio of the relative efficiencies, b’hen CEP*" and CEP** are used, 
the formula is 



(B.19) 






This comparison may be done by assuming that the true mean is 

either some point (u ,u ) or (0^0) • In the case that the assumption is 

X' y 

made that the true mean is (u ,u ) the ioint density function is 

X y 



/ 



(B.20) r(x,y; 



ex/?p ^[o\-u,f i-iy-»yT] 



ilheii it is assumed that the true mean is (0,0), the joint density 
function is 



(3.21) g(x,y; 0,0,^ ) - 



I 



3.nv^ 



ex/3 



/ 









The development of the ratio assuming that the true mean is (O^O) follo\7 
the procedure applied in formula (Boll)* The result is 






(B.23) E (V. [^ ) 



Vl/-/) 






V"' 



Combining formulas and D.23), th'3 racio function is 



/ 



(B.24) 



/ 



1% • i" o 



(/ 



f'/ f- 



r(^) n (A/' -D 

r ^ (AJ- i) 



- 1 



ifnen the mean is (0,0), the ratio function in (2,24) is less than 
1 for all n. Table k presents values of the ratio function for 
n = 2(l)20j 25(5)50. P.3. Itoranda tables this ratio for n = 2(1)8. 



Table k 



Values Of The Ratio Function When « u^^ = 0 



n 


R.F, 


n 


2 


• 482 j 


25 


3 


.656 ■ 


30 


4 1 


1 •'7^3 S 


35 


5 


' .795 i 


40 


5 


1 ,830 j 


45 


7 


.854 > 


50 


8 


1 .873 




9 


' .837 




10 


o 

• o .’O 




11 


.908 




12 


,915 ’ 




13 


.922 




14 


.927 




15 


.932 




16 


.937 




17 


.940 




IS 


,944 




19 


.947 




20 


.940 





o if o 

.959 

.960 

,971 

,974 

.977 

.979 



If it is knov-Ti that the true mean is some point (u^s>’-'„) then 
formula (3,20) is the joint density function of the component errors. 
The II, F, ratio for this case x as developeo by P,3, iloranda in reference 



(3). In order to find t>te lean square dovinticn of CJl-**, the sa:iio 

procedure can be follo'.7od as in formula (3.13) and the result is t!ic 

same as fonaula (B.22). The ir.ean square error of CdF** is a function 

of u and u . Ibranda assui-ned for case of computation that u = 1< V 

y :: 1 

and u = k V?” , 

y 2 



Letting u be defined by 

^ 0 . \ 

< I ^ ' 



(B.25) u = 



u has a non- central chi 



squared distribution. Values of .''..F, sho’.'n in Tabl ! 1 'an erpt fro: 
Table (1) in reference (3)) v?ere obtained by putting = k^, and vary- 
ing k from 0(.1)1»3. The results of this derivation shou' tb.at as n 
increases, the ratio function decreases for a constant value of I:. It 
can be ascertained fi-o.;i th.is table that for large n, C .T^‘* uill be the 
best estimte unless I; equals zero and CU?** \7ill be best Tor sn'.all n 
and snail values of !;* The practical use of tbe ratio under these 
assumptions require tl:c use of estimates to obtain the values of k 

1 

and k^ and although not exact, may still supply some useful information^ 



Table 1 



i 




f vr ' r*r 






h 


(c.r^ - 


c-r)2 ) 








1' ( ^ 


0.0 


0.1 


0.2 


r.3 


0.4 


0.5 


0.6 0.7 


0,8 


0.9 


1.0 


_n . 

z 


.482 


.437 


.503 


.530 


.575 


. 939 


.723 .'VO 


1.01 


1.21 


1.45 


3 


,354 


,663 


. So 5 


.727 


.796 


.:o3 


1.05 1,23 








4 


.743 


.731 


,777 


* j A- , 


.910 


1.06 


1.23 








5 


.796 


.804 


.334 


,3-'-3 


1.00 


1 1 0 










a 

•J 


.030 


.838 


.36, 


. ■ 37 


l.Of 


X « *- ‘0 










1 


* 0 o4 


.364 


,3,6 


. '72 


1.12 














. i73 


r> rv / 

9 


.17 


1 ^r\ 

X • ./vJ 


1 . 1 j 
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A possible procedure for using Table 1 is as follows; 

The values of x and y are first computed. Then ^ and ^are com- 
puted using fonnulas (B.ll) and (B.15) respectively. The estimated 
value of and k^ will then equal 



CB.29) 



''z “i 

f 



^ -A 

where v — r 



Using kj^ + 1<2 and n, an analysis of table 1 may show v;hen CEP** 



k- 



is not the best estimate. The reader should be cautioned that no 
attempt has been made to theoretically justify this procedure* 

In order to better illustrate the above, the computed values from 
the example problems for case 1 are shotm in tabic m. 



1 "" 








Table tn 

Conmuted Values of 1 


< 


problem 

Cl.'olO) 




y 


Ibdel 


I 


Ibdel II 


r 




“2 


_ N=i+ I'jViL, 


















1 


1.2 


2,0 


3,5 




3.2 


3.3 


,364 


.605 


.484 


2 


1*0 


.1 


3.0 




3.0 


3.0 


.333 


.033 


.183 


3 


.6 

1 -^^- ■ ^ 


.3 


4.1 




4.2 


4.1 I 


.146 


.073 


.109 



Analysis of Tabic tn Using the Above Values 



1 for k “ ,484» R.F, >1 for all n>5, therefore cip** is best 
estimate. 

2 for k » ,20, R.P, * .917 < 1, for n - 8, therefore CEP** is 
slightly better* 

3 for k “ .109, R.P. - .384 <1 for n « 8. CSP^'* is better. 



115 



] In<:rouuctiOii 

Jhis api^endix diaciasea ..le dcUails of tlrj ix L^^ratioi; { 

in fecizions II a:v: IV» I^.e ints^ration of '.^llipse or a circle over 
the bivariate noit/.al density function .jay ’'G si/plified by .r.akin 3 the 
transformation czcplainec in .Ippendix A, That is ,ro.. equation 



(C.l) 









..;v,v;.Uu-.v 



;:nere 






2 // V I 



exp 



,,2 






ud)! 



d,2 Integration Over Ciicl^ 

Ihe probabilit;^ tliat c raneb. point (i , /) enll lie ^'it in a circle 
u’ith center at the origin and radius k^is \:ritten as 



(C.2) 



o o 

U- 



) 






;V. ,V)gu V 






Iiie teo i ri j strations in uijy-rc f * 1 c'^ov the y^o'netric area of 
integration. 






■‘Circular irror Probabi li ties’* by H® Leon darter /4/ of \eronau« 
t i ca 1 He sea r cb Labo ra to r i e s » 



Ilf 






'H 



A /■ 

I , t 

/ ' ' 

' / 



/ 



/ 



X lI. ' 






Intcsra 


tion 


of the 


3ivar 


ia te 


density ?.nCuion Cve 














Figure CM 


In order 


to 


sir.:pli fy 


equation. 


3 (C*l) anC (C»2) let 




o( 

% 


= T.COG?, 


'Vu 




3in- , 


then 










/■,' 




• ; \ 


r; 






/ 


)\ 


" \ _ _L f^'^cos'^Q -h 






avj 




/ 1 


■ a - 


where 










i'-1<K 




J - 


J < 

1 ji't 


6U 


r: 










idj^ 














i 'J**'' 


D© 

















J 



J/vi c/ 9 



now let (2 - S' 5'/A/‘'£^' / - COS 

A' 



and the pi'obability is 






3 /I ^ / / 

c> o' 



let ~ ^ f 0 ~ 



/ c — 



yc/ 



and toe probability is 



P(Kc)= #- [(cb/) +(C-/)C05 01 i 0/2 

o 5 



7 T 

A' 
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in tic 



j ‘ ■ 






• '' 



l'(^^<) 



1 

o 



/ - C.X/^'- 






(C ‘ ti) + (C '-/) COi / 



?his Jol* i inLe..rat:C' -sin*: the trnposoi-'al rile and utilizing 
coniputers to 'O the intc^^^iawin;j. 

..^or cxai.iple, tlte curve belo ^7 represr>nts so .e fuactioi. that u’o ' i sh 
to integrate over ‘■’i- !esignatcc iut-^rval. can hivicle the interval 

into cc'ual sub intervals (d/*) arr 3*i. all of the srb intervals. As the 
0"b intervals 'u:Co::o S’.allerj the accuracy of this type of integration 
^ecoiaes b/jtter and this susioaticu tec .nique approaches the actual area 
.'nder the curve. 




iP 



A -h 



.’ranozoi'-’nl 'ccluii-.j ..i Inte,,ratin^ Uudci' a by -iriiation 

I'i'urc C.? 



h . ,.or...nlal:iau Tor antc'''ration \;if. u intervale t j. beco.,c-s 






OC:-. :) = 2 : > i--ot c--/4r) tcUi) (cUi) _ 

T ^ Tr-:1) + U-'.l; , 

t-no 



here - = JF 
n 



118 



I 




C' t - . . X i i., . . . .Lic.x XL -w lil^ L' 

. chebys-..*^ ■ ;‘Ol^i. 1 r i- Ow ' r ': ..hat coiivcr^js ..;ore 

ra^aol^' tl.ari l.h I! -..'.iccion. i.his c^ui slvo^r,. h; ^c...pai'iu_ thiC 

convcrp,ence of the t^;o :ieti’oJs. 1l re le. u. = 

:(x) = C 03 ; - - -: (c:;:-) 



_ 'T' /v.N t’l'^n 



r'.ie .^or. v.ila for integration no\: bscornes 



1 _ < ~ vt 

;c.7) A - ” 



n, - - - ‘7 /jac"<; 

// f o (c ^ / i ^ (c “ •■ / j 7^1 (/V j 



fills seT.r;e.ation is nov- . .adc- for different values of I:,c, and 1 ( 1 :, c), 

C *3 Intc_-;ral ir.'j, dvcir ..u ..llipse 

The probability tuat a rando. point (’■//) .ill lie \?ithin an ellipse 
\ 7 it: center cat the or i pin is written as 



V Oj 



•• r •• 
V - 



2 Tl Va 7 ly 



u“^ + v*^ )hiufv 

1 J 









fhe t'.;*o illustrations in fip:..^ J .3 show the pGo...etric area of integration 



\ 



f* . 

I 









' IT i' - 

— \- ■ A- --i 



4-. I 
! i 









1 / c^J</S 



Volume of Integration of 

ivariat^- ^ensi t^ function 



hi I ipse dor. .e b' llanc u ttiug 
g.. ^.(u,v) Parallel to u,v Plane 



fif_urc d ,3 
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I 

I 



tl.ul ii. t*. '.c L. .* cil ^ tM s i: ?.l:x ' ircivlcr, 

the three di-.icnsioiinl £zi... beiu;^ a perfect bell and cac two dir.e.isional 
for.;: beins a circle. 

In order to 'arLh.cr tliis for:* lot 

(Co ") u - 7^^'db^?, V = 7D fihe, then t..o ^probability becomes 

I exp IriJ^^^c/a^ x:here 

2 7TVI, V/ ^ ^ 

m<K 



(C. 10) i ™ 



: 7T 



r ; f /1 1 c/*ri J 9 

* L t ' 



7or.-nula '^.10 can .c inte;^raced directly by first integr^atin^ -.7ith 

res; ect to an.'. L‘ en niuh. respect to After iutc,_^rati:ig with respect 

to the forr.ula becoiees 

/V 



(d-11) 






c-v-(-,AA. .d . 



o 



If \;c let t=..“, thi" prob.':;i li ‘c_, stat.- ent beco- ,es 

K' 

(J.12) .'(Ic,t) - I I ex; t '-‘.t T’hei'C Z {t) is 'c’ao c'..i seuarec- density 

o 

function \7ith 'egruos of fx*e do... o.s defined in for.-.ula (h.5) 



,\sat is 



( :. 13) I ' j - I ; ^ '■ +- Ay < 

o. 4- 1/ T ‘ 



s'l/ 



i. / 

J 



/ ' c , 



For any value of rC!:,t) the value of !-7 can be obtained fror.i table 4 
by enterin'; with i'(h,t) and 2 deyrees of freedc;. . 
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